Picture for Di Lu

Di Lu

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

Sharp Eyes and Memory for VideoLLMs: Information-Aware Visual Token Pruning for Efficient and Reliable VideoLLM Reasoning

Add code
Nov 11, 2025
Viaarxiv icon

CEHA: A Dataset of Conflict Events in the Horn of Africa

Add code
Dec 18, 2024
Figure 1 for CEHA: A Dataset of Conflict Events in the Horn of Africa
Figure 2 for CEHA: A Dataset of Conflict Events in the Horn of Africa
Figure 3 for CEHA: A Dataset of Conflict Events in the Horn of Africa
Figure 4 for CEHA: A Dataset of Conflict Events in the Horn of Africa
Viaarxiv icon

From Prohibition to Adoption: How Hong Kong Universities Are Navigating ChatGPT in Academic Workflows

Add code
Oct 02, 2024
Viaarxiv icon

AKEM: Aligning Knowledge Base to Queries with Ensemble Model for Entity Recognition and Linking

Add code
Sep 13, 2023
Figure 1 for AKEM: Aligning Knowledge Base to Queries with Ensemble Model for Entity Recognition and Linking
Figure 2 for AKEM: Aligning Knowledge Base to Queries with Ensemble Model for Entity Recognition and Linking
Viaarxiv icon

FATRER: Full-Attention Topic Regularizer for Accurate and Robust Conversational Emotion Recognition

Add code
Jul 23, 2023
Viaarxiv icon

Event Extraction as Question Generation and Answering

Add code
Jul 10, 2023
Viaarxiv icon

A New Task and Dataset on Detecting Attacks on Human Rights Defenders

Add code
Jun 30, 2023
Figure 1 for A New Task and Dataset on Detecting Attacks on Human Rights Defenders
Figure 2 for A New Task and Dataset on Detecting Attacks on Human Rights Defenders
Figure 3 for A New Task and Dataset on Detecting Attacks on Human Rights Defenders
Figure 4 for A New Task and Dataset on Detecting Attacks on Human Rights Defenders
Viaarxiv icon

BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics

Add code
Dec 20, 2022
Figure 1 for BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics
Figure 2 for BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics
Figure 3 for BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics
Figure 4 for BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics
Viaarxiv icon

Information-driven Path Planning for Hybrid Aerial Underwater Vehicles

Add code
Apr 08, 2022
Figure 1 for Information-driven Path Planning for Hybrid Aerial Underwater Vehicles
Figure 2 for Information-driven Path Planning for Hybrid Aerial Underwater Vehicles
Figure 3 for Information-driven Path Planning for Hybrid Aerial Underwater Vehicles
Figure 4 for Information-driven Path Planning for Hybrid Aerial Underwater Vehicles
Viaarxiv icon